Predicting hyperarticulate speech during human-computer error resolution

نویسندگان

  • Sharon L. Oviatt
  • Margaret MacEachern
  • Gina-Anne Levow
چکیده

When speaking to interactive systems, people sometimes hyperarticulate— or adopt a clarified form of speech that has been associated with increased recognition errors. The goals of the present study were: (1) to establish a flexible simulation method for studying users’ reactions to system errors, (2) to analyze the type and magnitude of linguistic adaptations in speech during human-computer error resolution, (3) to provide a unified theoretical model for interpreting and predicting users’ spoken adaptations during system error handling, and (4) to outline the implications for developing more robust interactive systems. A semi-automatic simulation method with a novel error generation capability was developed to compare users’ speech immediately before and after system recognition errors, and under conditions varying in error base-rate. Matched original-repeat utterance pairs then were analyzed for type and magnitude of linguistic adaptation. When resolving errors with a computer, it was revealed that users actively tailor their speech along a spectrum of hyperarticulation, and as a predictable reaction to their perception of the computer as an "at risk" listener. During both low and high error rates, durational changes were pervasive, including elongation of the speech segment and large relative increases in the number and duration of pauses. During a high error rate, speech also was adapted to include more hyper-clear phonological features, fewer disfluencies, and change in fundamental frequency. The two-stage CHAM model (Computer-elicited Hyperarticulate Adaptation Model) is proposed to account for these changes in users’ speech during interactive error resolution.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting Hyperarticulate Speech during Human-computer Error Resolution1

When speaking to interactive systems, people sometimes hyperarticulate— or adopt a clarified form of speech that has been associated with increased recognition errors. The goals of the present study were: (1) to establish a flexible simulation method for studying users’ reactions to system errors, (2) to analyze the type and magnitude of linguistic adaptations in speech during human-computer er...

متن کامل

Predicting hyperarticulate speech during resolution ’ human - computer error

When speaking to interactive systems, people sometimes hyperurticulate or adopt a clarified form of speech that has been associated with increased recognition errors. The goals of the present study were (1) to establish a flexible simulation method for studying users’ reactions to system errors, (2) to analyze the type and magnitude of linguistic adaptations in speech during human-computer erro...

متن کامل

Modeling global and focal hyperarticulation during human-computer error resolution.

When resolving errors with interactive systems, people sometimes hyperarticulate--or adopt a clarified style of speech that has been associated with increased recognition errors. The primary goals of the present study were: (1) to provide a comprehensive analysis of acoustic, prosodic, and phonological adaptations to speech during human-computer error resolution after different types of recogni...

متن کامل

Modeling hyperarticulate speech during human-computer error resolution

Hyperarticulate speech to computers remains a poorly understood phenomenon, in spite of its association with elevated recognition errors. The present research analyzes the type and magnitude of linguistic adaptations that occur when people engage in error resolution with computers. A semi-automatic simulation method incorporating a novel error generation capability was used to collect speech da...

متن کامل

Linguistic adaptations during spoken and multimodal error resolution.

Fragile error handling in recognition-based systems is a major problem that degrades their performance, frustrates users, and limits commercial potential. The aim of the present research was to analyze the types and magnitude of linguistic adaptation that occur during spoken and multimodal human-computer error resolution. A semiautomatic simulation method with a novel error-generation capabilit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Speech Communication

دوره 24  شماره 

صفحات  -

تاریخ انتشار 1998